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Abstract 

Random walks, and in particular, their first passage times, are ubiquitous in nature. Using direct enu¬ 
meration of paths, we find the first return time distribution of a ID random walker, which is a heavy-tailed 
distribution with infinite mean. Using the same method we find the last return time distribution, which 
follows the arcsine law. Both results have a broad range of applications in physics and other disciplines. 
The derivation presented here is readily accessible to physics undergraduates, and provides an elementary 
introduction into random walks and their intriguing properties. 
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Thermal and statistical physics texts often begin with a discussion of random walks and their 

associated applications such as Brownian motion,'^ polymer physics,^ and laser cooling of atoms 
The first passage time distribution F{r,t), i.e. the distribution of times t at which a target r is 
first reached, stands out as central to many natural phenomena such as quenching of fluorescent 
molecules,^ molecular rupture (times over which molecules dissociate in, e.g., ligand-receptor 
complexes), and target site searches (e.g. transcription factors finding corresponding binding sites 
along DNA),- as well as additional problems in biology.- In the context of finance, an optimal 
trading strategy might be to sell an asset when it first reaches a threshold value.— 

Insofar as undergraduate texts give an impression that the bell-shaped curve rules the world, 
first passage time distributions supply a nice counterexample by their heavy-tailed behavior, e.g., 
~ in ID. Here we supply an elementary derivation of this result by examining first returns on 
an infinite ID lattice. While general results^^i^ are available for first return distributions on infinite 
J-dimensional lattices, derivations rely on generating functions or Laplace transforms which may 
be unfamiliar to undergraduates. In contrast, the approach below yields the power law after just a 
few lines of mathematics by entirely elementary means. It can thus serve as a friendly primer to 
random walks, first passage, recurrence, and heavy-tailed distributions. 

AID random walk is a succession of N steps to the right or left with respective probabilities p 
and q = I — p, occurring at every time interval At = T (hence N = t/x). We focus on the case of 
a symmetric walk {p = q = 1/2) but the same formalism may be applied to biased walks {p ^ q). 
All walks considered here begin at the origin (r = 0) at time t = 0 and have steps of identical 
length 1. The first return time is the time at which the walk first reaches the origin; similarly, the 
last return time is the time at which the origin is last visited. See Fig. [T] for an example of a ID 
random walk trajectory with its first and last returns marked. 

Assuming spatial and temporal initial conditions (r,t) = (0,0), the first return time distribution 
is F{r = 0,t), denoted as F{t) hereafter. Its cumulative jQF{t')dt' is the probability to return to 
the origin by time t. The complement is the survival probability S{t), i.e. the probability to not 
return by time t\ 

j\{t')dt'=\-S{t) F(0 = -^. (1) 

S{t) is found by enumerating all survival paths (those not returning to the origin) in the first N 
steps. The probability of such a path occurring is given by the ballot theorem: In a ballot where 
candidates A and B have a and b total votes, respectively, the probability that A is always ahead of 
B throughout counting is {a — b)/{a-\-b). To enumerate survival paths we use a proof of the ballot 
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theorem, known as the cycle lemma. The latter’s cyclical representation of paths is ideal for 
counting their partial sums and eliminating those which return to the origin, as explained below. 

Consider a circular track with N numbers, a fraction of which are +1 and the rest are —1 
(see Fig. O. The numbers +1 and —1 signify a step to the right and left, respectively. Such a 
configuration has N possible clockwise paths along the circular track starting at each of the N 
numbers. Consider first those survival paths which remain to the right of and never cross the 
origin: the sum of numbers along such paths is always positive. Any +1 followed by a —1 may be 
eliminated from the circular track as the two paths starting at either number are not in this class of 
right-of-the-origin survival paths; furthermore, their removal does not affect any other path’s sum 
since a {+1, —1} pair’s net sum is zero. Repeating this procedure until no —I’s remain yields the 
number of Al’s in excess of — I’s, i.e. the number of valid paths. The probability of choosing a 
valid path from a given track is then 


N+-N^ _ 

N “ N “ N 


( 2 ) 


where N-\- and denote the number of Al’s and — I’s, respectively. The probability in Eq.[2]is 
non-negative since N+ > A for a path to remain to the right of the origin. To obtain the total 
number of valid paths from all possible circular track configurations, Eq. [2] is multiplied by the 
number of possible {-t-1, —1} arrangements {^ )■ The result is the ballot theorem: For a given 
A_, the number of paths remaining to the right of the origin is (n-)- Because A+ must 

exceed A_ for the walker to remain to the right of the origin, A_ can range from 0 to where 


r 



FIG. 1. First and last returns: The above plot is an example ID random walk of A = 25 steps beginning 
at the origin r = 0 at time t = 0. The ordinate and abscissa are the distance r traveled and time t/z elapsed 
(number of steps taken), respectively. The first and last returns to the origin are marked in red. In this case, 
the first return time is Afirst = t/z = 4, while the last return time is Aiast = t/z = 24. 
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FIG. 2. Enumerating survival paths which remain to the right of the origin: N numbers are written on a 
circular track where each +1 and — 1 denote a step to the right and left, respectively. The solid red and 
dashed blue arrows represent the start of two possible clockwise paths. The red (solid) path is a survival 
path, i.e. does not return to the origin, whereas the blue (dashed) path is not a survival path as its running 
sum along the path is not always positive. The orange oval highlights a {+1,-1} pair whose constituent 
numbers do not start survival paths. Furthermore, the pair’s sum is zero and therefore it may be removed 
from the circular track without affecting any path’s running sum. 

the latter floor funetion denotes the largest integer not greater than Summing over these 

values yields the number of paths whieh remain to the right of the origin: 



The summation in Eq.[3]is simplified by binomial identities (”) = (”_}) + 
as follows: 



Furthermore, “ 1 which leads to the final result of Eq.[3l By symmetry, the number 

of paths which remain to the left of the origin is the same as in Eq. |3l Thus the number of survival 
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paths is 2(|^^2 j) • The survival probability S{N) follows simply; each path has probability (1/2)^ 
and therefore 

S(iV) = 2-<«-')([;-;j). (5) 

In the continuum limit where N is large (t S> t), Stirling’s approximation A^! y/^{N/ef for 
Eq- 13 gives S{t): 

^ “ 'M- 

The survival probability decays to zero for long times t implying that the walk will eventually 
return to the origin with probability 1. This is in accord with Polya’s recurrence theorerrti^ii^ that 
symmetric random walks return to the origin on infinite lattices of dimension d <2. 



FIG. 3. Heavy-tailed distributions: The first return time distribution F{t) = y 2 ^ ~ is shown above, 
where T = 10^^ <C t. It is an example of a heavy-tailed distribution, which is typical for first passage time 
distributions. The average return time diverges; long return times in this distribution’s heavy tail dominate 
the average. 


By Eq. [H the first return distribution is 

It follows that the distribution’s first moment, the average return time, diverges: 

noo noo 

{treturn) = / tF{t)dtoc / ft~^^^dt = oo. ( 8 ) 

Jo Jo 

Diverging moments are the hallmark of heavy-tailed distributions; in this case, long return times 
dominate the average. Fig.[3]shows the heavy tail distribution of F{t) ~ 

Our derivation yields insight into last return times as well. The probability to return for the last 
time at step 2ni (an even number of steps implied) is the product of the probabilities to be at the 
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X = njn 


FIG. 4. Last return times'. The probability of returning to the origin for the last time at x = n^/n for n = 100. 
Note its symmetric behavior about the minimum x = 1/2. Last returns are much more likely to occur either 
very early or late in the walk. 


P{0<x<b) 



FIG. 5. The arcsine law: The probability that the last return occurs within the first fraction b of the full walk 
duration has the form P{0 <x <b) = (Ijn) arcsin(\/^). 

origin at step 2ni and of surviving 2n — 2ni steps thereafter. The former is 2^^"^ sinee there 
are ways to take an equal number of steps right and left. For large ni the latter probability 
beeomes 1 /^/Wnl by Stirling’s approximation. Multiplying by survival probability 1 / sj7t{n — ni) 
yields the probability that the last return oeeurs at step 2 rl: 

— ^ ^ (9) 

Tt\/nL{n — ni) ;rnyx(l—x) 

where x = ni/n. Eq. |9]is symmetrie about its minimum x = 1 /2 with singular maxima oeeurring at 
X = 0 and x = 1 (see Fig.ll]). Integrating Eq.|9]yields the arcsine law^ P(Q <x<b) = ^ arcsin(\/^) 
as shown in Eig.[5l The arcsine law also describes the number of positive partial sums in a sequence 
of mutually independent random variables from probability distributions other than the binomial. 
While this rather counterintuitive result is seldom encountered in physics texts, the law has striking 
consequences likely to excite physics students. Feller— (see Vol. 1, Section III.4) describes it in 
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the context of coin-tossing games where losing and winning sides map to the left and right of the 
origin, respectively, and equalization of the fortunes signifies a return to the origin: 

The results are startling. According to widespread beliefs a so-called law of averages 
should ensure that in a long coin-tossing game each player will be on the winning side 
for about half the time, and that the lead will pass not infrequently from one player 
to the other. Imagine then a huge sample of records of ideal coin-tossing games, each 
consisting of exactly 2n trials. We pick one at random and observe the epoch of the 
last tie... With probability 1/2 no equalization occurred in the second half of the game, 
regardless of the length of the game. Furthermore, the probabilities near the end points 
are greatest... These results show that intuition leads to an erroneous picture of the 
probable effects of chance fluctuations. 

The last return distribution is tied to the time spent on either side of the origin, which also fol¬ 
lows the arcsine law.^^ It is highly probable to remain on one side of the origin for nearly the 
entire walk, leading to long waiting times. Recent implications include hard-spheres gas particles 
colliding with the same neighbors for an extended period of time.— Other examples where the 
arcsine law is obeyed include the time of maximal displacement in ID Brownian motion,-^^^ lead 
changes within competitive team sports games,-^^ and the probability distribution of longitudinal 
displacements of tracer particles in split flow.^^ 

In summary, we have reported on an elementary derivation of first and last return times which 
also serves as an introduction to a variety of important and broadly applicable concepts such as 
recurrence, first passage, heavy-tailed distributions, and the arcsine law. 
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